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(54) Title: TEST AND MODEL FOR ALZHEIMER'S DISEASE 



(57) Abstract 

Model systems of Alzheimer's disease comprise a DNA sequence encoding an amy- 
loid precursor protein (APP) isoform or fragment that has an amino acid substitution. The 
substituted amino acid may be other than valine at the amino acid position corresponding 
to amino acid residue position 717 of APP770. Methods of determining genetic predisposi- 
tion to Alzheimer's disease are also disclosed. 
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TEST AND MODEL FOR ALZHEIMER'S DISEASE 

BACKGROUND OF THE INVENTION 
Alzheimer's disease is a progressive disease known generally as 
5 senile dementia. Broadly speaking the disease falls into two categories, namely 
late onset and early onset. Late onset, which occurs in old age (65 + years), may 
be caused by the natural atrophy of the brain occurring at a faster rate and to a 
more severe degree than normal. Early onset Alzheimer's disease is much more 
infrequent but shows a pathologically identical dementia with diffuse brain atrophy 
10 . which develops well before the senile period, between the ages of 35 and 60 
years. There is evidence that one form of this type of Alzheimer's disease shows 
a tendency to run in families and is therefore known as familial Alzheimer's 
disease (FAD). 

In both types of Alzheimer's disease the pathology is the same but 
15 the abnormalities tend to be more severe and more widespread in cases beginning 
at an earlier age. The disease is characterized by two types of lesions in the brain, 
these are senile plaques and neurofibrillary tangles. 

Senile plaques are areas of disorganized neuropil up to 150/im 
across with extracellular amyloid deposits at the center. Neurofibrillary tangles 
2 0 are intracellular deposits of amyloid protein consisting of two filaments twisted 
about each other in pairs. 

The major protein subunit, 0-amyloid protein, of the amyloid 
filaments of the senile plaque is a highly aggregating small polypeptide of 
approximate relative molecular mass 4,500. This protein is a cleavage product of 
25 a much larger precursor protein called amyloid precursor protein (APP). 

At present there is no known effective therapy for the various forms 
of Alzheimer's disease (AD). However, there are several other forms of dementia 
for which treatment is available and which give rise to progressive intellectual 
deterioration closely resembling the dementia associated with Alzheimer's disease. 
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A diagnostic test for AD would therefore provide a valuable tool in the diagnosis 
and treatment of these other conditions, by way of being able to exclude 
Alzheimer's disease. It will also be of value when a suitable therapy does become 
available. 

5 Also important is the development of experimental models of 

Alzheimer's disease that can be used to define further the underlying biochemical 
events involved in AD pathogenesis. Such models could presumably be employed, 
in one application, to screen for agents that alter the degenerative course of 
Alzheimer's disease. For example, a model system of Alzheimer's disease could 
10 be used to screen for environmental factors that induce or accelerate the 
pathogenesis of AD. In contradistinction, an experimental model could be used 
to screen for agents that inhibit, prevent, or reverse the progression of AD. 
Presumably, such models could be employed to develop pharmaceuticals that are 
effective in preventing, arresting, or reversing AD. 

15 

SUMMARY OF THE INVENTION 
The present invention provides model systems of Alzheimer's 
disease, wherein the model system comprises a DNA sequence encoding an 
amyloid precursor protein (APP) isoform or fragment that has an amino acid other 
2 o than valine at the amino acid position corresponding to amino acid residue position 
717 of APP770. 

In a first embodiment, the present invention provides an isolated 
DNA sequence that encodes an amyloid precursor protein (APP) isoform or 
fragment that has an amino acid other than valine at the amino acid position 
25 corresponding to amino acid residue position 717 of APP770. 

In a second embodiment, the present invention provides a transgenic 
nonhuman animal that harbors at least one integrated copy of a human DNA 
sequence that encodes an amyloid precursor protein (APP) isoform or fragment 
that has an amino acid other than valine at the amino acid position corresponding 
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to amino acid residue position 717 of APP770. 

In a third embodiment, the present invention provides a transgenic 
nonhuman animal wherein at least one of the endogenous nonhuman APP alleles 
has been completely or partially replaced by all or a portion of a human APP gene 
5 that includes a codon 717 that does not encode valine. 

In a fourth embodiment, the present invention provides cells, 
typically mammalian cells and preferably mammalian cells of the neural, glial, or 
astrocytic lineage, that have been transformed or transfected with a heterologous 
DNA sequence, or have been derived from a transgenic nonhuman animal, wherein 
10 the cells express an amyloid precursor protein (APP) isoform or fragment that has 
an amino acid other than valine at the amino acid position corresponding to amino 
acid residue position 717 of APP770. In accordance with standard protocols, 
cultured human cells, either primary cultures or immortalized cell lines, may be 
transfected, either transiently or stably, with a mutant APP allele so that the 
15 cultured human cell expresses a mutant APP polypeptide. 

In a fifth embodiment, the present invention provides a method of 
producing transgenic nonhuman animals and transformed cells that contain a DNA 
sequence encoding an amyloid precursor protein (APP) isoform or fragment that 
has an amino acid other than valine at the amino acid position corresponding to 
20 amino acid residue position 717 of APP770. 

In a sixth embodiment, the present invention provides a method of 
producing, free from other human proteins, a human amyloid precursor protein 
(APP) isoform or fragment that has an amino acid other than valine at the amino 
acid position corresponding to amino acid residue position 717 of APP770. 
25 In a seventh embodiment, the present invention provides a human 

amyloid precursor protein (APP) isoform or fragment, free from other human 
proteins, that has an amino acid other than valine at the amino acid position 
corresponding to amino acid residue position 717 of APP770. 
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In an eighth embodiment, the invention provides a method for 
detecting an APP allele that is linked (i.e., cosegregates with) a genetic 
predisposition to Alzheimer's disease, particularly early onset AD, wherein such 
a pathognomonic APP allele is detected by determining that codon 717 of the allele 
does not encode valine. Preferably, a pathognomonic APP allele is detected when 
codon 717 is determined to encode either isoleucine, glycine, or phenylalanine. 
Thus, methods for locating the presence of genetic alterations associated with 
Alzheimer's disease are provided. This diagnostic method may be used to predict 
the development of the disease prior to onset, for genetic screening, or to detect 
a specific mutation in an experimental nonhuman animal or a cell. 

In a ninth embodiment, the invention provides a human variant APP 
polypeptide free of other human proteins, typically present in a cell of a nonhuman 
animal. The invention also relates to an isolated nucleic acid encoding such a 
polypeptide and to uses and applications of such nucleic acid as are described 
above in relation to the specific embodiment of the invention which involves an 
amino acid substitution at position 717 (as defined in relation to APP770). 

According to one aspect of the invention there is provided a method 
for detecting the presence, in a nucleic acid or other sample removed from a 
subject, of the gene for Alzheimer's disease comprising identifying a genetic 
alteration in a gene sequence coding for APP. Such genetic alterations may 
include mutations, insertions or deletions. 

BRIEF DESCRIPTION OF THE FIGURES 
Fig. 1 illustrates a first pedigree in which early onset AD is 
apparently inherited as an autosomal dominant disorder. The average age of onset 
in this family is 57±5 years. Black symbols denote affected individuals and 
oblique lines indicate individuals who are deceased. Females are denoted by 
circles and males by squares. Triangles are used in the present generation to 
preserve anonymity. In generation II the spouses of the two affected brothers were 
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sisters. Samples were available from the 13 individuals whose haplotypes are 
illustrated, from a further 19 children and spouses of these individuals and from 
7 more distantly related unaffected individuals. Beneath the pedigree are 
ideograms of the two chromosomes 21 in each individual of the third generation 
5 at four loci on the long arm of the chromosome. The linkage data suggest that the 
black chromosomes were inherited from the affected fathers. 

Fig. 2 shows an autoradiograph of a sequencing gel from part of 
exon 17 of the APP gene in a normal and an affected individual from the Fig. 1 
pedigree showing a single base pair change at base pair 2149 in the affected 
10 individual. This C to T transition leads to an amino acid substitution of a valine 
by an isoleucine at codon 717. 

Fig. 3 shows part of the amino acid sequence encoded by exons 16 
and 17 of the APP gene showing the mutation valine to isoleucine (V to I) within 
the transmembrane domain and the mutation causing HCHWA-D (E to Q) in the 
15 extracellular domain. The shaded region of the transmembrane domain and the 
boxed amino acids of the extracellular domain represent the sequence of the 
deposited jS-amyloid peptide. Adapted from Kang et al (1987) Nature 325:733. 

Fig. 4 shows Bell digests of the exon 17 PCR product from 
unaffected and affected individuals in an early onset AD family showing 
20 co-segregation of the restriction site and the disease. 

Fig. 5 shows the pedigree of family F19, together with D21S210 

data. 

Fig. 6 shows APP exon 17 sequences in an affected and unaffected 
member of F19. In the affected member there is a G- > T transition at position 2150. 
25 Fig. 7 shows the sequence of APP695. 

Fig. 8 shows the sequence of APP751. 
Fig. 9 shows the sequence of APP770. 



WO 92/13069 PCT/GB92/00123 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

The accumulation of 0-amyloid protein (A4) in particular brain 
regions is one of the main pathologic characteristics of Alzheimer's disease. The 
0-amyloid protein is an approximately 4 kD protein (39 to 42 amino acids) which 
5 is derived, as an internal cleavage product, from one or more isoforms of a larger 
amyloid precursor protein (APP). There are at least five distinct isoforms of APP 
containing 563, 695, 714, 751, and 770 amino acids, respectively (Wirak et al. 
(1991) Science 253:323). These isoforms of APP are generated by alternative 
splicing of primary transcripts of a single gene, designated the APP gene, which 

10 is located on human chromosome 21. It is known that most of the APP isoforms 
are glycosylated transmembrane proteins (Goldgaber et al. (1987) Science 
235:877), and that four of the isoforms, AA563, APP714, APP751 and APP770, 
have a protease inhibitor domain that is homologous to the Kunitz type of serine 
protease inhibitors. The /S-amyloid (A4) segment comprises approximately half of 

15 the transmembrane domain and approximately the first 28 amino acids of the 
extracellular domain of an APP isoform. 

Proteolytic processing of APP in vivo is a normal physiological 
process. Carboxy-terminal truncated forms of APP695, APP751, and APP770 are 
present in brain and cerebrospinal fluid (Palmert et al. (1989) Proc. Natl. Acad. 

20 Sci. U.S.A. 86:6338; Weidemann et al. (1989) Cell 57:115) and result from 
cleavage of the APP isoform at a constitutive cleavage site within the A4 peptide 
domain of an APP isoform (Esch et al. (1990) Science 248:1122). Normal 
proteolytic cleavage at the constitutive cleavage site yields a large (approximately 
100 kD) soluble, N-terminal fragment that contains the protease inhibitor domain 

25 in some isoforms, and a 9-10 kD membrane-bound, C-terminal fragment that 
includes most of the A4 domain. 

Generation of pathogenic /3-amyloid (A4) protein appears to be the 
result of aberrant or alternative proteolytic processing of APP, such that normal 
cleavage at the constitutive site within the A4 domain does not occur, but rather 
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cleavage occurs at two specific sites which flank the A4 domain. One of these 
aberrant cleavage sites is in the transmembrane domain and the other aberrant 
cleavage site is located approximately at the N-terminus of the first 28 amino acids 
of the extracellular domain (see Fig. 3). Such aberrant proteolytic cleavage 
5 produces the 0-amyloid A4 polypeptide which is prone to forming dense 
amyloidogenic aggregates that are resistant to proteolytic degradation and removal. 
The resultant 0-amyloid aggregates presumably are involved in the formation of 
the abundant amyloid plaques and cerebrovascular amyloid that are the 
neuropathological hallmarks of Alzheimer's disease. However, the exact aberrant 
10 cleavage sites are not always precise; jS-amyloid molecules isolated from the brain 
of a patient with AD show N- and C- terminal heterogeneity. Therefore, the 
aberrant cleavage pathway may involve either sequence-specific proteolysis 
followed by exopeptidase activity (creating end-heterogeneity), or may not be 
sequence-specific. 

15 The APP gene is known to be located on human chromosome 21. 

A locus segregating with familial Alzheimer's disease has been mapped to 
chromosome 21 (Hyslop et al. (1987) Science 235:885) close to the APP gene. 
Recombinants between the APP gene and the AD locus have been previously 
reported (Schellenberg et al. (1988) Science 241:1507). The data appeared to 

20 exclude the APP gene as the site of any mutation that might cause FAD (Van 
Broekhoven et al. (1987) Nature 329:153; Tanzi et al. (1987) Nature 329:156). 

Recombinant DNA technology provides several techniques for 
analyzing genes to locate possible mutations. For example, the polymerase chain 
reaction (Bell (1989) Immunology Today, 10:351) may be used to amplify specific 

25 sequences using intronic primers, which can then be analyzed by direct 
sequencing. 

Researchers working in the area of the hereditary cerebral 
haemorrhage with amyloidosis of the Dutch type ("HCHWA-D") (Levy et al 
(1990) Science 248:11224) found a substitution of Glu to Gin at residue 618 (using 
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the APP695 numbering system) in APP which is thought to result in the deposition 
of /3-amyloid in the cerebral vessels of these patients. The present inventors have 
identified a single base substitution, a C to T transition at base pair 2149, has been 
found in part of the sequence of the APP gene in some cases of familial 
5 Alzheimer's disease. This base pair transition leads to an amino acid substitution, 
i.e., valine to isoleucine at amino acid 717 (APP^o) (see Yoshikai et al. (1990) 
Gene ££257), close to the C-terminus of the /3-amyloid protein. This suggests that 
some cases of Alzheimer's disease are caused by mutations in the APP gene, and 
specifically mutations that change codon 717 such that it encodes an amino acid 

10 other than valine. 

Additionally, a further single base substitution, a T to G transition 
at adjacent base pair 2150, has been found in part of the sequence of the APP gene 
in other cases of familial Alzheimer's disease. This base pair transition leads to 
a different amino acid substitution, namely valine to glycine, at amino acid 717, 

15 thereby strengthening the argument that some cases of Alzheimer's disease are 
caused by mutations in the APP gene, specifically at codon 717. 

It is now clear that a mutation in the APP gene locus that results in 
a substitution of isoleucine for valine at codon 717 (residue 642 in APP695) gives 
rise to AD in some families (Goate et al. (1991) Nature 242:704). A second APP 

20 allelic variant wherein glycine is substituted for valine at codon 717 is now 
identified, and is so closely linked to the AD phenotype as to indicate that allelic 
variants at codon 717 of the APP gene, particularly those encoding an amino acid 
other than valine, and more particularly those encoding a isoleucine, glycine, or 
phenylalanine, are pathogenic and/or pathognomonic alleles (Chartier-Harlin et al. 

25 (1991) Nature 252:844). 

Proteolysis on either side of the /3-amyloid (A4) region of APP may 
be enhanced or qualitatively altered by the specific mutations at codon 717, 
increasing the rate of 0-amyloid deposition and aggregation. Such codon 717 
mutations may increase /3-amyloid formation by providing a poorer substrate for 
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the main proteolytic pathway (cleavage at the constitutive site) or a better substrate 
for a competing, alternative cleavage pathway (at aberrant cleavage sites). 

DEFINITIONS 

5 A number of terms and expressions are used throughout the 

specification and, to facilitate the understanding thereof, the following definitions 
are provided: 

As used herein, "exon" refers to any segment of an interrupted gene 
that is represented in the mature RNA product. 
10 As used herein, H intron" refers to a segment of an interrupted gene 

that is not represented in the mature RNA product. Introns are part of the primary 
nuclear transcript but are spliced out to produce mRNA, which is then transported 
to the cytoplasm. 

As used herein, the phrase "gene sequence coding for amyloid 
15 protein precursor" may be interpreted to mean the DNA and cDNA sequence as 
detailed by Yoshikai et al (1990) Gene £7:257 and Kang et al f loc. cit. . together 
with the promoter DNA sequence as described by Salbaum et al (1988) EMBO 
7(2}:2807. 

As used herein, the terms "label" or "labeled" refers to 
20 incorporation of a detectable marker (e.g., by incorporation of a radiolabeled 
nucleotide or by end-labeling with a terminal radiolabeled phosphate). DNA or 
RNA is typically labeled by incorporation of a radiolabeled nucleotide (H 3 , C 14 , 
S 35 , P 32 ) or a biotinylated nucleotide that can be detected by marked avidin (e.g., 
avidin containing a fluorescent marker or enzymatic activity) or digoxygeninylated 
25 nucleotide that can be detected by marked specific antibody. 

As used herein, "isoform", "APP", and "APP isoform" refer to a 
polypeptide that is encoded by at least one exon of the APP gene (Kitaguchi et al. 
(1988) Nature 221:530; Ponte et al., ML, p.525; R.E. Tanzi, McL, p.528; de 
Sauvage and Octave (1989) Science 245:651; Golde et al. (1990) Neuron 4:253). 
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An APP isoform may be encoded by an APP allele (or exon thereof) that is 
associated with a form of Alzheimer's disease or that is not associated with an AD 

disease phenotype. 

The term "0-amyloid gene H is used herein as a synonym for the APP 
5 gene, as 0-amyloid is a protein product produced by a post-translational cleavage 
of an APP gene product 

As used herein, "fragment" refers to a polypeptide of at least about 
9 amino acids, typically 50 to 75, or more, wherein the polypeptide contains an 
amino acid core sequence (listed in order from amino- to carboxy-terminal 

10 direction): 

-ne-Ala-Thr-Val-He-X-Ile-Thr-Leu- [SEQ ID NO:6] 
where X is any of the twenty conventional amino acids except valine, and 
particularly where X is isoleucine, glycine, or phenylalanine. A fragment may be 
a truncated APP isoform, modified APP isoform (as by amino acid substitutions, 

15 deletions, or additions outside of the core sequence), or other variant polypeptide 
sequence, but is a naturally-occurring APP isoform or 0-amyloid polypeptide 
that is present in a human individual, whether affected by AD or not. If desired, 
the fragment may be fused at either terminus to additional amino acids, which may 
number from 1 to 20, typically 50 to 100, but up to 250 to 500 or more. 

20 As used herein, " APP75 1 " and "APP770" refer, respectively, to the 

751 and 770 amino acid residue long polypeptides encoded by the human APP 
gene (Ponte et al. loc. cit .: Kitaguchi et al. loc. cit. : Tanzi et al. log, gi^)- 

As used herein, "codon 717" refers to the codon (i.e., the 
trinucleotide sequence) that encodes the 717th amino acid position in APP770, or 

25 the amino acid position in an APP isoform or fragment that corresponds to the 
717th position in APP770. For example but not limitation, a 670 residue long 
fragment that is produced by truncating APP770 by removing the 100 N-terminal 
amino acids has its 617th amino acid position corresponding to codon 717. In fact, 
as used herein, codon 717 refers to the codon that encodes the 698th amino acid 



WO 92/13069 PCT/GB92/00123 



11 

residue of APP751 [SEQ ID NO:2] and the 642nd amino acid residue of APP695 
[SEQ ID NO:l]. 

As used herein, "human APP isoform or fragment" refers to an APP 
isoform or fragment that contains a sequence of at least 9 consecutive amino acids 
5 that is identical to a sequence in a human APP770, APP751, or APP695 protein 
that occurs naturally in a human individual, and wherein an identical sequence is 
not present in an APP protein that occurs naturally in a nonhuman species. 

A nucleic acid is "operably linked" when it is placed into a 
functional relationship with another nucleic acid sequence. For instance, a 
10 promoter or enhancer is operably linked to a coding sequence if it affects the 
transcription of the sequence. With respect to transcription regulatory sequences, 
operably linked means that the DNA sequences being linked are contiguous and, 
where necessary to join two protein coding regions, contiguous and in reading 
frame. 

15 The term "corresponds to" is used herein to mean that a sequence 

is homologous (i.e., is identical, not strictly evolutionarily related) to all or a 
portion of a reference sequence. In contradistinction, the term "complementary 
to" is used herein to mean that the complementary sequence is homologous to all 
or a portion of a reference sequence. For illustration, the nucleotide sequence 

20 "TATAC" corresponds to a reference sequence "TATAC" and is complementary 
to a reference sequence "GTATA". 

The term "transcriptional enhancement" is used herein to refer to 
functional property of producing an increase in the rate of transcription of linked 
sequences that contain a functional promoter. 

25 The term "agent" is used herein to denote a chemical compound, a 

mixture of chemical compounds, a biological macromolecule, or an extract made 
from biological materials such as bacteria, plants, fungi, or animal (particularly 
mammalian) cells or tissues. Agents are evaluated for potential biological activity 
by inclusion in screening assays described hereinbelow. 
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As used herein, the term "mutant" refers to APP alleles having 
missense mutations that are pathognomonic for a genetic predisposition for 
developing AD; specifically a mutation at codon 717 (as referenced by the amino 
acid sequence in APP770) of the APP gene, such that codon 717 encodes one of 
5 the nineteen amino acids that are not valine (i.e., glycine, methionine, alanine, 
serine, isoleucine, leucine, threonine, proline, histidine, cysteine, tyrosine, 
phenylalanine, glutamic acid, tryptophan, arginine, aspartic acid, asparagine, 
lysine, and glutamine), but preferably isoleucine, glycine, or phenylalanine. Thus 
a mutant APP770 polypeptide is an APP770 polypeptide that has an amino acid 

10 residue at position 717 that is not valine. Other mutant APP isoforms comprise 
a non-valine amino acid at the amino acid residue position that corresponds to 
codon 717 (i.e., that is encoded by codon 717). Similarly, a mutant APP allele 
or a variant APP codon 717 allele is an APP allele that encodes an amino acid 
other than valine at codon 717 (referenced to the human APP770 deduced 

15 translation as described in the "codon 717" definition, supra), preferably 
isoleucine, glycine, or phenylalanine. Hence, an APP allele that encodes valine 
at codon 717 is a "wild-type" APP allele. 

It is apparent to one of skill in the art that nucleotide substitutions, 
deletions, and additions may be incorporated into the polynucleotides of the 

2 o invention. However, such nucleotide substitutions, deletions, and additions should 
not substantially disrupt the ability of the polynucleotide to hybridize to one of the 
polynucleotide sequences shown in Figs. 5 and 6 under hybridization conditions 
that are sufficiently stringent to result in specific hybridization. 

"Specific hybridization" is defined herein as the formation of hybrids 

25 between a probe polynucleotide (e.g., a polynucleotide of the invention which may 
include substitutions, deletion, and/or additions) and a specific target 
polynucleotide (e.g., a polynucleotide having the sequence ), wherein the probe 
preferentially hybridizes to the specific target such that, for example, a band 
corresponding to a variant APP allele or restriction fragment thereof, can be 



WO 92/13069 



PCT/GB92/00123 



13 

identified on a Southern blot, whereas a corresponding wild-type APP allele (L^, 
one that encodes valine at codon 717) is not identified or can be discriminated 
from a variant APP allele on the basis of signal intensity. Hybridization probes 
capable of specific hybridization to detect a single-base mismatch may be designed 
according to methods known in the art and described in Maniatis et aL, MQlggular 
ginning: A L^ratonr Manual . 2nd Ed., (1989), Cold Spring Harbor, N.Y. and 
Berger and Kimmei, Methods in En zvmologv. Volume 152. Guide to Molecular 
Pinning Techniques (1987), Academic Press, Inc., San Diego, CA; Gibbs et al. 
(1989) ^"rtric. Acids Res. 17:2437; Kwok et al. (1990) Nucleic Ac^S Res, 
1&999; Miyada et al. (1987) Methods Enzvmol. 154:94, each of which is 
incorporated herein by reference. The T m for oligonucleotides is calculated under 
standard conditions (1 M NaCl) to be [4°C x (G+C) + 2°C x (A+T)]. While 
the conditions of PCR differ from the standard conditions, this T m is used as a 
guide for the expected relative stabilities of oligonucleotides. Allele-specific 
primers are typically 13-15 nucleotides long, sometimes 16-21 nucleotides long, 
or longer; when short primers are used, such as a 14 nucleotide long primer, low 
annealing temperatures are used, typically 44 to 50°C, occasionally somewhat 
higher or lower depending on the base composition of the sequence(s). 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Detection of Mutant Codon 717 APP Alleles 

In an embodiment of the invention, the method involves identifying 
a genetic alteration at amino acid 717, which may cause the consensus Val to be 
changed, for example, to another hydrophobic residue. This will generally be 
performed on a specimen removed from the subject. Hydrophobic residues include 
Leu, Ala, lie and Gly, the first three of which have aliphatic side chains. Phe also 
has a hydrophobic residue which may be appropriate. As indicated above, 
preferred residues include lie, Gly, and Phe (Murrell et al, (1991) Science 
254:97). 
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The fact that these mutations discussed above are at the same codon 
may be a coincidence, but this seems unlikely on statistical grounds. There are 
two possibilities that may explain these data. First, substitution of the valine 
residue at codon 717 may result in increased beta-amyloid deposition due to 
5 changes in APP metabolism. Secondly, the variation in the sequence around this 
position may result in increased translation of APP mRNAs and thus cause AD by 
a route analogous to that by which AD is believed to be caused in Down Syndrome 
(Tanzi and Hyman (1991) Nature 350:564 and Rumble et al. (1989) N. Engl J. 
Med. 320: 1446). In situ hybridization studies have shown that APP 717 mutations 

10 do not alter APP expression (Harrison et al. (1991) Neurorep. 2:152). 

The V717I (APP 717 Val->Ile), V717G (APP 717 
Val->Gly) and V717F (APP 717 Val->Phe) mutations would destabilise a 
putative stem loop structure and destroy a possible iron-responsive element 
between base pairs 2131 and 2156 (Tanzi and Hyman, loc. at.). There are several 

15 other possible mutations which could also disrupt this structure, many of which 
would be silent at the protein level; yet these mutations specifically referred to 
have involved a change to the same amino acid, and no silent changes or changes 
to other amino acids have been reported prior to the work described herein. 
Examination of sequence data from 10 other mammalian species (Johnstone et al. 

20 (1991) Mol Brain Res. 10:299) shows that while the valine residue at codon 717 
is conserved in all of them, the putative stem loop structure postulated from the 
human sequence (Tanzi and Hyman loc. cit.) would not be predicted to occur in 
either cattle or sheep; and in pig and mouse the consensus sequence for the 
iron-responsive elements is not present. Finally, such stem loop structures are 

25 believed to modulate gene translation by altering mRNA stability (Klausner and 
Harford (1989) Science 246:870); however, Harrison and colleagues (Harrison et 
al. loc. cit.) have shown by in situ hybridization that APP mRNAs are not grossly 
altered in the brain of an individual with the V717I mutation. For these reasons, 
it is believed likely that alterations in the rate of APP translation caused by the 
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specific mutations identified are not likely to be the key to their pathogenicity. 

The fact that the specific mutations discussed involve different 
changes (Val->Ile, Val->GIy, and Val->Phe) suggests that neither side-chain 
hydrophobicity nor side-chain bulk is the crucial issue. All examples of APP 
5 alleles that encode an amino acid other than valine at codon 717, cosegregate with 
FAD; suggesting that the valine that occurs at position 717 in wild-type APP770 
or APP751 is a critical amino acid residue for non-pathogenic APP proteolytic 
processing (i.e., by the constitutive cleavage pathway). 

The major metabolic pathway for the APP molecule involves 

10 cleavage within the beta-amyloid fragment (Esch et al. be. tit.). To generate 
beta-amyloid, there must be a second pathway in which APP is cleaved outside this 
sequence. Such a cleavage would be likely to leave a stub of the APP molecule 
containing the beta-amyloid fragment embedded in the membrane. Possibly, the 
beta-amyloid-containing fragment which is generated by the second pathway is 

15 degraded by peptidase action; the reported mutations may be pathogenic because 
peptides which contain them may be more resistant to the actions of this peptidase. 
Therefore, genetic alterations in the APP gene which result in altered (generally 
reduced) degradative properties are particularly important in the application of the 
invention. There are several methodologies available from recombinant DNA 

20 technology which may be used for detecting and identifying a genetic mutation 
responsible for Alzheimer's disease. These include direct probing, polymerase 
chain reaction (PCR) methodology, restriction fragment length polymorphism 
(RFLP) analysis and single strand conformational analysis (SSCA). 

Detection of point mutations using direct probing involves the use 

25 oligonucleotide probes which may be prepared synthetically or by nick translation. 
The DNA probes may be suitably labelled using, for example, a radiolabel, 
enzyme label, fluorescent label, biotin-avidin label and the like for subsequent 
visualization in for example a Southern blot hybridization procedure. The labelled 
probe is reacted with the sample DNA bound to a nitrocellulose or Nylon 66 
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substrate. The areas that carry DNA sequences complementary to the labelled 
DNA probe become labelled themselves as a consequence of the reannealling 
reaction. The areas of the filter that exhibit such labelling may then be visualized, 
for example, by autoradiography. 
5 Alternative probing techniques, such as ligase chain reaction (LCR) 

involve the use of mismatch probes, Lfi,, probes which have full complementarity 
with the target except at the point of the mutation. The target sequence is then 
allowed to hybridize both with oligonucleotides having full complementarity and 
oligonucleotides containing a mismatch, under conditions which will distinguish 

10 between the two. By manipulating the reaction conditions it is possible to obtain 
hybridization only where there is full complementarity. If a mismatch is present 
then there is significantly reduced hybridization. 

The polymerase chain reaction (PCR) is a technique that amplifies 
specific DNA sequences with remarkable efficiency. Repeated cycles of 

15 denaturation, primer annealing and extension carried out with a heat stable enzyme 
Taq polymerase leads to exponential increases in the concentration of desired DNA 
sequences. 

Given a knowledge of the nucleotide sequence encoding the 
precursor of amyloid protein of AD (Kang et al. loc. tit., and Yoshikai, above) 

20 it may be possible to prepare synthetic oligonucleotides complementary to 
sequences which flank the DNA of interest. Each oligonucleotide is 
complementary to one of the two strands. The DNA is then denatured at high 
temperatures (e.g., 95 *Q and then reannealed in the presence of a large molar 
excess of oligonucleotides. The oligonucleotides, oriented with their 3' ends 

25 pointing towards each other, hybridize to opposite strands of the target sequence 
and prime enzymatic extension along the nucleic acid template in the presence of 
the four deoxyribonucleotide triphosphates. The end product is then denatured 
again for another cycle. After this three-step cycle has been repeated several times, 
amplification of a DNA segment by more than one million fold can be achieved. 
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The resulting DNA may then be directly sequenced in order to locate any genetic 
alteration. Alternatively, it may be possible to prepare oligonucleotides that will 
only bind to altered DNA, so that PCR will only result in multiplication of the 
DNA if the mutation is present. Following PCR, allele-specific oligonucleotide 
5 hybridization (Dihella et al. (1988) Lancet 1:497) may be used to detect the AD 
point mutation. Alternatively an adaptation of PCR called amplification of specific 
alleles (PAS A) can be employed; this uses differential amplification for rapid and 
reliable distinction between alleles that differ at a single base pair. 

In yet another method PCR may be followed by restriction 

10 endonuclease digestion with subsequent analysis of the resultant products. The 
substitution of T for C at base pair 2149, found as a result of sequencing exon 17, 
creates a BcR restriction site. The creation of this restriction endonuclease 
recognition site facilitates the detection of the AD mutation using RFLP analysis 
or by detection of the presence or absence of a polymorphic Bell site in a PCR 

15 product that spans codon 717. 

For RFLP analysis, DNA is obtained, for example, from the blood 
of the subject suspected of having AD and from a normal subject is digested with 
the restriction endonuclease Bell and subsequently separated on the basis of size 
using agarose gel electrophoresis. The Southern blot technique can then be used 

20 to detect, by hybridization with labeled probes, the products of endonuclease 
digestion. The patterns obtained from the Southern blot can then be compared. 
Using such an approach, DNA spanning an Alzheimer's mutation that creates or 
removes a restriction site at codon 717, such as the Bell site, is detected by 
determining the number of bands detected and compa^ng this number to a 

25 reference allele that has a codon 717 allele that encodes valine. 

Correspondingly, the substitution of G for T at base pair 2150 
creates a SfaNl restriction site (GCATC), which may be exploited in a manner 
similar to that described above, mutatis mutandis. Similar creation of additional 
restriction sites by nucleotide substitutions within codon 717, wherein the codon 
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717 encodes an amino acid other than valine, can be readily calculated by 
reference to the genetic code and a list of nucleotide sequences recognized by 
restriction endonucleases fPromepa ProtQmk and Applications Guide. (1991) 
Promega Corporation, Madison, Wisconsin). 
5 Single strand conformational analysis (SSCA) (Orita et al. (1989) 

Genomics 5:874 and Orita et al. (1990) Genomics 6:271) offers a relatively quick 
method of detecting sequence changes which may be appropriate in at least some 
instances. 

PCR amplification of specific alleles (PASA) is a rapid method of 

10 detecting single-base mutations or polymorphisms (Newton et al. (1989) Nucleic 
AcjdiRes, 12:2503; Nichols etal. (1989) fisnflmifiS 5:535; Okayamaetal. (1989) 
j t j*h. Hin.Med. 114 :105: Sarkaretal. (1990) Anal. Bipchem, 186:64; Sommer 
etal. nQ8<r> lvTavn Clin. Proc. 64:1361: Wu (1989) Ptp<T. Nfrfl. A<*d. Sci, U.S.A. 
S&2757; and Dutton et al. (1991) Biotechnioues 11:700). PASA (also known as 

15 allele specific amplification) involves amplification with two oligonucleotide 
primers such that one is allele-specific. The desired allele is efficiently amplified, 
while the other allele(s) is poorly amplified because it mismatches with a base at 
or near the 3' end of the allele-specific primer. Thus, PASA or the related method 
of PAMSA may be used to specifically amplify one or more variant APP codon 

20 717 alleles. Where such amplification is done on genetic material (or RNA) 
obtained from an individual, it can serve as a method of detecting the presence of 
one or more variant APP codon 717 alleles in an individual. 

Similarly, a method known a ligase chain reaction (LCR) has been 
used to successfully detect a single-base substitution in a hemoglobin allele that 

25 causes sickle cell anemia (Barany et al. (1991) Pme. Natl. Acad. Sci. U.S.A. 
58:189; Weiss (1991) Science 254:1292). LCR probes may be combined, or 
multiplexed for simultaneously screening for multiple different mutations. Thus, 
one method of screening for variant APP codon 717 alleles is to multiplex at least 
two, and preferably all, LCR probes that will detect an APP allele having a codon 
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717 that does not encode valine, but that does encode an amino acid. The universal 
genetic code provides the degenerate sequences of all the encoded non-valine 
amino acids, thus LCR probe design for detecting any particular variant codon 717 
allele is straightforward, and multiplexed pools of such LCR probes may be 
5 selected in the discretion of a practitioner for his particular desired use. 

In performing diagnosis using any of the above techniques or 
variations thereof, it is preferable that several individuals are examined. These 
may include an unaffected parent, an affected parent, an affected sibling, an 
unaffected sibling as well as other perhaps more distant family members. 

10 

Model Animals and Cell Lines 

Having identified specific mutations in codon 717 of the jS-amyloid 

gene as a cause of familial Alzheimer's disease (FAD), it is possible, using genetic 

manipulation, to develop transgenic model systems and/or whole cell systems 
15 containing the mutated FAD gene (or a portion thereof) for use, for example, as 

model systems for screening for drugs and evaluating drug effectiveness. 

Additionally, such model systems provide a tool for defining the underlying 

biochemistry of APP and (8-amyloid metabolism, which thereby provides a basis 

for rational drug design. 
20 One type of cell system can be naturally derived. For this, blood 

samples from the affected subject must be obtained in order to provide the 

necessary cells which can be permanently transformed into a lymphoblastoid cell 

line using, for example, Epstein-Barr virus. 

Once established, such cell lines can be grown continuously in 
25 suspension culture and may be used for a variety of in vitro experiments to study 

APP expression and processing. 

Since the FAD mutation is dominant, an alternative method for 

constructing a cell line is to engineer genetically a mutated gene, or a portion 

thereof spanning codon 717, into an established (either stably or transiently) cell 
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line of choice. Sisodia (1990) Science 248:492) has described the insertion of a 
normal APP gene, by transfection, into mammalian cells. Oltersdorf et al. ((1990) 
/. Biol. Chem. 265:4492) describe the insertion of APP into immortalized 
eukaryotic cell lines. 

5 Baculovirus expression systems are useful for high level expression 

of heterologous genes in eukaryotic cells. Knops et al. (1991) /. Biol Chem. 
266(1 1):7285 describes the expression of APP using such a system. 

In yet a further use of the present method, it may be possible to 
excise the mutated gene <j.c, a variant APP codon 717 gene) for use in the 

10 creation of transgenic animals containing the mutated gene. For example, an 
entire human variant APP codon 717 allele may be cloned and isolated, either in 
parts or as a whole, in a cloning vector (e.g., XCharon35, cosmid, or yeast 
artificial chromosome). The human variant APP codon 717 gene, either in parts 
or in whole, may be transferred to a host nonhuman animal, such as a mouse. As 

15 a result of the transfer, the resultant transgenic nonhuman animal will preferably 
express one or more variant APP codon 717 polypeptides. Most preferably, a 
transgenic nonhuman animal of the invention will express one or more variant APP 
codon 717 polypeptides in a neuron-specific manner (Wirak et al. (1991) EMEQ 
1Q:289). This may be accomplished by transferring substantially the entire human 

2 o • APP gene (encoding a codon 717 mutant) including the 4.5 kilobase sequence that 
is adjacent to and upstream of the first major APP transcriptional start site. 

Alternatively, one may design minigenes encoding variant APP 
codon 717 polypeptides. Such minigenes may contain a CDNA sequence encoding 
a variant APP codon 717 polypeptide, preferably full-length, a combination of APP 

25 gene exons, or a combination thereof, linked to a downstream polyadenylation 
signal sequence and an upstream promoter (and preferably enhancer). Such a 
minigene construct will, when introduced into an appropriate transgenic host (e.g., 
mouse or rat), express an encoded variant APP codon 717 polypeptide, most 
preferably a variant APP codon 717 polypeptide that contains either an isoleucine, 
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glycine, or phenylalanine residue at codon 717 of APP770 or the corresponding 
position in an APP isoform or fragment. 

One approach to creating transgenic animals is to target a mutation 
to the desired gene by homologous recombination in an embryonic stem (ES) cell 
5 line in vitro followed by microinjection of the modified ES cell line into a host 
blastocyst and subsequent incubation in a foster mother (see Frohman and Martin 
(1989) Cell 5fr 145). Alternatively, the technique of microinjection of the mutated 
gene, or a portion thereof, into a one-cell embryo followed by incubation in a 
foster mother can be used. Various uses of transgenic animals, particularly 

10 transgenic animals that express a wild-type APP isoform or fragment, are disclosed 
in Wirak et al (1991) EMBO, 10(2):289; Schilling et al (1991) Gene 98(2):225; 
Quon et al (1991) Nature 252:239; Wirak et al (1991) Science 252:323; and 
Kawabata et al (1991) Nature 254:476. Additional methods for producing 
transgenic animals are known in the art. 

15 Alternatively, site-directed mutagenesis and/or gene conversion can 

be used to mutate a murine (or other nonhuman) APP gene allele, either 
endogenous or transfected, such that the mutated allele does not encode valine at 
the codon position in the mouse APP gene that corresponds to codon 717 (of 
APP770) of the human APP gene (such position is readily identified by homology 

20 matching of the murine APP gene or APP protein to the human APP gene or 
APP770 protein). Preferably, such a mutated murine allele would encode 
isoleucine or glycine or phenylalanine at the corresponding codon position. 

Therapeutics 

25 Having detected the genetic mutation in the gene sequence coding 

for /S-amyloid protein in an individual not yet showing overt signs of familial AD, 
using the method of the present invention, it may be possible to employ gene 
therapy, in the form of gene implants, to prevent the development of the disease. 

Additional embodiments directed to modulation of the production of 
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variant APP proteins include methods that employ specific antisense 
polynucleotides complementary to all or part of a variant APP sequence, or for 
some embodiments a wild-type APP sequence. Such complementary antisense 
polynucleotides may include nucleotide substitutions, additions, deletions, or 
5 transpositions, so long as specific hybridization to the relevant target sequence, 
i Ci > a variant APP codon 717 sequence, is retained as a property of the 
polynucleotide. Thus, an antisense polynucleotide must preferentially bind to a 
variant APP (i.e., codon 717 does not encode valine) sequence as compared to a 
wild-type APP (i.e., codon 717 does encode valine). It is evident that the 

10 antisense polynucleotide must reflect the exact nucleotide sequence of the variant 
allele (or wild-type allele where desired) and not a degenerate sequence. 

Complementary antisense polynucleotides include soluble antisense 
RNA or DNA oligonucleotides which can hybridize specifically to a variant APP 
MRNA species and prevent transcription of the mRNA species and/or translation 

15 of the encoded polypeptide (Ching et al. (1989) Proc, Natl, Anul , 5ci. U.S . A. 
S&10006; Broder et al. (1990) Ann. Tnt. Med. 112:604; Loreau et al. (1990) 
FEBS Letters 271:53-56); Holcenberg et al. W091/11535; U.S. No. 7,530,165 
("New human CRIPTO gene" - publicly available through Derwent Publications 
Ltd., Rochdale House, 128 Theobalds Road, London, UK); WO91/09865; 

20 WO91/04753; WO90/13641; and EP 386563, each of which is incorporated herein 
by reference). The antisense polynucleotides therefore inhibit production of the 
variant APP polypeptides. Antisense polynucleotides may preferentially inhibit 
transcription and/or translation of mRNA corresponding to a variant (or wild-type) 
polypeptides can inhibit T lymphocyte activation. 

25 Antisense polynucleotides may be produced from a heterologous 

expression cassette in a transfectant cell or transgenic cell or animal, such as a 
transgenic neural, glial, or astrocytic cell, preferably where the expression cassette 
contains a sequence that promotes cell-type specific expression (Wirak et al. loc. 
dr.). Alternatively, the antisense polynucleotides may comprise soluble 
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oligonucleotides that are administered to the external milieu, either in the culture 
medium in vitro or in the circulatory system or interstitial fluid in vivo . Soluble 
antisense polynucleotides present in the external milieu have been shown to gain 
access to the cytoplasm and inhibit translation of specific mRNA species. In some 
5 embodiments the antisense polynucleotides comprise methylphosphonate moieties. 
For general methods relating to antisense polynucleotides, see Antisense R NA and 
DNA . (1988), D.A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY). 

10 Mutant APP Antigens and Monoclonal Antibodies 

In yet another aspect of the invention, having detected a genetic 
alteration in a gene sequence coding for APP, it may be possible to obtain samples 
of the altered jS-amyloid protein from the same source. This protein may be 
derived from the brain tissue of a subject diagnosed as suffering from Alzheimer's 

15 disease, or more preferably are produced by recombinant DNA methods or are 
synthesized by direct chemical synthesis on a solid support. Such polypeptides will 
contain an amino acid sequence of an APP variant allele spanning codon 717. 
Examples of such sequences are: 

(a) -Ile-Ala-Thr-Val-Ile-Gly-Ile-Thr-Leu- [SEQ ID NO:7] 

20 (b) -ne-Ala-Thr-Val-De-Met-Ile-Thr-Leu- [SEQ ID NO: 8] 

(c) -ne-Ala-Thr-Val-Ile-Ala-Ile-Thr-Leu- [SEQ ID NO:9] 

(d) -ne-Ala-Thr-Val-ne-Ser-Ile-Thr-Leu- [SEQ ID NO: 10] 

(e) -ne-Ala-Thr-Val-Ile-Ile-De-Thr-Leu- [SEQ ID NO: 11] 

(f) -Ile-Ala-Thr-Val-Ile-Leu-Ile-Thr-Leu- [SEQ ID NO: 12] 
25 (g) -He-Ala-Thr-Val-Ile-Thr-Ile-Thr-Leu- [SEQ ID NO: 13] 

(h) -He-Ala-Thr-Val-Ile-Pro-Ile-Thr-Leu- [SEQ ID NO: 14] 

(i) -Ile-Ala-Thr-Val-Ile-His-Ile-Thr-Leu- [SEQ ID NO: 15] 
(j) -Ile-Ala-Thr-Val-Ile-Cys-Ile-Thr-Leu- [SEQ ID NO: 16] 
(k) -Ile-AIa-Thr-Val-Ile-Tyr-Ile-Thr-Leu- [SEQ ID NO: 17] 
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(I) -De-AIa-Thr-Val-Ile-Phe-IIe-Thr-Leu- [SEQ ID NO: 18] 
(m) -ne-Ala-Thr-Val-ne-GIu-ne-Thr-Leu- [SEQ ID NO: 19] 

(n) -ne-Ala-Thr-Val-Ile-Trp-ne-Thr-Leu- [SEQIDNO:20] 

(o) -He-AIa-Thr-Val-Ile-Arg-ne-Thr-Leu- [SEQ ID NO:21] 

(p) -ne-Ala-Thr-Val-Iie-Asp-ne-Thr-Uu- [SEQIDNO:22] 

(q) -Ile-Ala-Thr-Val-Ile-Asn-ne-Thr-Leu- [SEQ ID NO:23] 

(r) -ne-Ala-Thr-Val-ne-Lys-Ile-Thr-Leu- [SEQ ID NO:24] 

(s) -ne-Ala-Thr-Val-Ile-Gln-Ile-Thr-Leu- [SEQ ID NO:25] 

Using such polypeptide material it may then be possible to prepare 
antisera and monoclonal antibodies using, for example, the method of Kohler and 
Milstein ((1975) Nature 256:495). Such monoclonal antibodies could then form 
the basis of a diagnostic test. 

Such variant APP polypeptides may be used to immunize an animal 
15 for the production of specific antibodies. These antibodies may comprise a 
polyclonal antiserum or may comprise a monoclonal antibody produced by 
hybridoma cells. For general methods to prepare antibodies, see Antibodies; A 
juratory Manual . (1988) E. Harlow and D. Lane, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, which is incorporated herein by reference. 
20 For example but not for limitation, a recombinantly produced 

fragment of a variant APP codon 717 polypeptide can be injected into a mouse 
along with an adjuvant so as to generate an immune response. Murine 
immunoglobulins which bind the recombinant fragment with a binding affinity of 
at least 1 x 10 7 M* 1 can be harvested from the immunized mouse as an antiserum, 
25 and may be further purified by affinity chromatography or other means. 
Additionally, spleen cells are harvested from the mouse and fused to myeloma cells 
to produce a bank of antibody-secreting hybridoma cells. The bank of hybridomas 
can be screened for clones that secrete immunoglobulins which bind the 
recombinantly produced fragment with an affinity of at least 1 x 10 6 M" 1 . More 
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specifically, immunoglobulins that bind to the variant APP codon 717 polypeptide 
but have limited crossreactivity with a wild-type (i.e., codon 717 encodes valine) 
APP polypeptide are selected, either by preabsorption with wild-type APP or by 
screening of hybridoma cell lines for specific idiotypes that preferentially bind the 
5 variant as compared to the wild-type. 

The nucleic acid sequences of the present invention capable of 
ultimately expressing the desired variant APP polypeptides can be formed from a 
variety of different polynucleotides (genomic or cDNA, RNA, synthetic 
oligonucleotides, etc.) as well as by a variety of different techniques. 

!0 As stated previously, the DNA sequences will be expressed in hosts 

after the sequences have been operably linked to (Ljl., positioned to ensure the 
functioning of) an expression control sequence. These expression vectors are 
typically replicable in the host organisms either as episomes or as an integral part 
of the host chromosomal DNA. Commonly, expression vectors will contain 

15 selection markers, e.g., tetracycline resistance or hygromycin resistance, to permit 
detection and/or selection of those cells transformed with the desired DNA 
sequences (s££, U.S. Patent 4,704,362, which is incorporated herein by 
reference). 

Polynucleotides encoding a variant APP codon 717 polypeptide may 
2 o include sequences that facilitate transcription (expression sequences) and translation 

of the coding sequences, such that the encoded polypeptide product is produced. 

Construction of such polynucleotides is well known in the art and is described 

further in Maniatis et al. Molecular Cloning: A La boratory Manual. 2nd Ed. 

(1989), Cold Spring Harbor, N.Y. For example, but not for limitation, such 
25 polynucleotides can include a promoter, a transcription termination site 

(polyadenylation site in eukaryotic expression hosts), a ribosome binding site, and, 

optionally, an enhancer for use in eukaryotic expression hosts, and, optionally, 

sequences necessary for replication of a vector. 
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£. coii is one prokaryotic host useful particularly for cloning the 
DNA sequences of the present invention. Other microbial hosts suitable for use 
include bacilli, such as Bacillus SU&fc and other enterobacteriaceae, such as 
Salmonella . Serratia. and various Pseydprnpnag species. In these prokaryotic 
5 hosts, one can also make expression vectors, which will typically contain 
expression control sequences compatible with the host cell (si, an origin of 
replication). In addition, any number of a variety of well-known promoters will 
be present, such as the lactose promoter system, a tryptophan (trp) promoter 
system, a beta-lactamase promoter system, or a promoter system from phage 

10 lambda. The promoters will typically control expression, optionally with an 
operator sequence, and have ribosome binding site sequences and the like, for 
initiating and completing transcription and translation. 

Other microbes, such as yeast, may also be used for expression. 
Sareharomvces is a preferred host, with suitable vectors having expression control 

15 sequences, such as promoters, including 3-phosphoglycerate kinase or other 
glycolytic enzymes, and an origin of replication, termination sequences and the like 
as desired. 

In addition to microorganisms, mammalian tissue cell culture may 
also be used to express and produce the polypeptides of the present invention (sss, 
Winnacker, "From Genes to Clones," VCH Publishers, N.Y., N.Y. (1987), which 
is incorporated herein by reference). Eukaryotic cells are actually preferred, 
because a number of suitable host cell lines capable of secreting intact human 
proteins have been developed in the art, and include the CHO cell lines, various 
COS cell lines, HeLa cells, myeloma cell lines, Jurkat cells, etc. Expression 
vectors for these cells can include expression control sequences, such as an origin 
of replication, a promoter, an enhancer (Queen et al. (1986) Tmmunpl, Rev, 3g:49, 
which is incorporated herein by reference), and necessary processing information 
sites, such as ribosome binding sites, RNA splice sites, polyadenylation sites, and 
transcriptional terminator sequences. Preferred expression control sequences are 
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promoters derived from immunoglobulin genes, SV40, Adenovirus, Bovine 
Papilloma Vims, and the like. The vectors containing the DNA segments of 
interest (e.g.. polypeptides encoding a variant APP polypeptide) can be transferred 
into the host cell by well-known methods, which vary depending on the type of 
5 cellular host. For example, calcium chloride transfection is commonly utilized for 
prokaryotic cells, whereas calcium phosphate treatment or electroporation may be 
used for other cellular hosts. (See, generally . Maniatis, et al. Molecular Cloning: 
A Laboratory Manual . Cold Spring Harbor Press, (1982), which is incorporated 
herein by reference.) 

10 Alternatively, homologous recombination may be used to insert an 

APP mutant sequence into a host genome at a specific site, for example, at a host 
APP locus. In one type of homologous recombination, one or more host 
sequence(s) are replaced; for example, a host APP allele (or portion thereof) is 
replaced with a mutant APP allele (or portion thereof). In addition to such gene 

15 replacement methods, homologous recombination may be used to target a mutant 
APP allele to a specific site other than a host APP locus. Homologous 
recombination may be used to produce transgenic non-human animals and/or cells 
that incorporate mutant APP alleles. 

The method lends itself readily to the formulation of test kits which 

20 can be utilized in diagnosis. Such a kit would comprise a carrier being 
compartmentalized to receive in close confinement one or more containers wherein 
a first container may contain suitably labelled DNA probes. Other containers may 
contain reagents useful in the localization of the labelled probes, such as enzyme 
substrates. Still other containers may contain a restriction enzyme (such as BcR), 

25 buffers and the like, together with instructions for use. 

EXPERIMENTAL EXAMPLES 
The following examples are provided for illustration and are not 
intended to limit the invention to the specific example provided. 
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fiVAMPT.K 1 - Potion of a V al->ne mutation in the fl-amvloid (APP) gene 

The segregation of AD and markers along the long arm of 
chromosome 21 in a single family with autopsy-confirmed Alzheimer's disease (see 
Fig. 1) were examined. DNA samples were available from a total of six affected 
5 and 33 unaffected and at risk individuals. 

The APP gene in an affected family member was analyzed by 
polymerase chain reaction (PCR) direct sequencing using intronic primers 
(Gyllensten, U. in PCR Technology, Ed. Erlich, H.A., Stockton Press, 45-60, 
1989; Yoshikai et al. (1990) Gene 87:257). (Fig. 2). The primers were made 
10 according to the manufacturer's protocol using a Gene Assembler Plus (Pharmacia 
LKB). 

PCR was carried out using the following intronic primers in order 
to amplify exon 17 of the APP gene: 

15 [A] 5'-CCTCATCCAAATGTCCCCGTCATT-3' [SEQ ID NO:26] AND 

[B] 5'-GCCTAATTCTCTCATAGTCTTAATTCCCAC-3' [SEQ ID NO 27] 

PCR conditions were 94 °C for lOmin to denature; then 35 cycles of 60° C for 
lmin, 72°C for 3min, 94°C for 1.5min; and a single cycle of 72°C for lOmin. 
20 The reaction was carried out using lOmM tris-HCl pH 8.3, 50mM potassium 
chloride, 0.01% gelatin, 1.5mM magnesium chloride, 200/iM of dNTPs, 50 
pmoles of each PCR primer and 1 unit of Taq polymerase. The total final reaction 

volume was 25/d. 

A second PCR reaction was then performed with a final 
25 concentration of 50 pmol of primer [A] and 0.5 pmol of primer [B]. The PCR 
product was purified on a centricon 100 microconcentrator (Amicon) and used 
directly for sequencing with the SEQUENASE kit (version 2.0, United States 
Biochemical Corp.; the word SEQUENASE is a trade mark) following the 
manufacturer's protocol. 
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Exon 17 was sequenced first because it encodes part of the 
0-amyloid peptide and is the site of the mutation (at APP693) leading to Hereditary 
Cerebral Haemorrhage with Amyloidosis-Dutch Type (HCHWA-D). 

Sequencing of exon 17 revealed a C to T transition at base pair 
5 2149, causing a valine to isoleucine change at amino acid 717 (Fig. 2 and Fig. 3). 

This C to T transition creates a Bell restriction site enabling 
detection within the PCR product (Fig. 4). Bell digests were carried out at 50°C 
for 2-4 hours, as recommended by the manufacturer, then electrophoresed in 3% 
agarose* 

10 Screening by PCR of 100 unrelated, normal individuals and 14 cases 

(9 families) of familial late onset disease failed to demonstrate this substitution. 
Screening of 1 1 (9 families) cases of early onset familial disease revealed the BcE 
restriction site in two affected individuals from an unrelated family. The genetic 
data show that the disease loci are linked to the missense mutation. Also, failure 

15 to detect this polymorphism in 200 normal chromosomes supports the contention 
that it is a pathogenic mutation. 

The valine to isoleucine substitution occurs within the 
transmembrane domain two residues from the C-terminus of the 0-amyloid peptide. 
Computer analysis predicts that the substitution makes the transmembrane more 

20 hydrophobic and might thus anchor the protein more firmly within the membrane. 
The position of the substitution, two residues from the C-terminus of the 0-amyloid 
peptide may be of significance to the origin of the deposited peptide. This finding 
links Alzheimer's disease to HCHWA-D, a disease in which amyloid deposition 
is due to a mutation closer to the N-terminus but withir the /3-amyloid peptide 

25 (Levy et al. loc. eft.)- 
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F r Y AMPLE ? f - Preparation nf a cell lin e containing a defective g-amylc-id (AFP) 
gene 

10ml of fresh blood are collected from each individual suffering 
from familial Alzheimer's disease. Lymphocytes are purified from the blood on a 
Percoll gradient and mixed with Epstein-Barr virus (EBV). The cells are then 
plated out in medium supplemented with 10% foetal calf serum, antibiotics, 
glutamine and Cyclosporin A to kill the T lymphocytes. B lymphocytes which are 
infected by EBV become immortalized and establish a permanent cell line derived 
from the B cells of the patient. 

A lymphoblastoid cell line, AC21, has been deposited with the 
European Collection of Animal Cell Cultures, Porton Down. 



15 



20 



F XAMPT.F. 3 - Petition nf a Val - >niv mutation in the g-amvloid fAPP) gene 
A pedigree, designated F19 and shown in Fig. 5, which has 
autopsy-confirmed AD with an onset age of 59+4 years was identified by 
observing that an allele of the highly polymorphic dinucleotide repeat marker 
GT12 (D21S210), which is located close to the APP gene, co-segregated with the 
disease. Linkage analysis gave a peak lod score between the marker and the 
disease of 3.02 at a recombination fraction of zero, as the following table shows: 
Theta 0 0.01 0.05 0.1 0.2 0.3 0.4 
Lod 3.02 2.97 2.75 2.47 1.86 1.22 0.6 

Lod scores were calculated with seven liability classes modelling 
age-dependent penetrances from 0.01 to 0.95 with a phenocopy rate of 0.001 and 
25 a gene frequency of 0.001 using MLINK from the LINKAGE package (Lathrop 
et al. (1984) Proc. Natl. Acad. Sci. USA 81:3443). 

APP exon 17 sequences in an affected and an unaffected member of 
F19 were determined. In the affected member, there was a G->T transition at 
position 2150, as can be seen from Fig. 6. 
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The amplification of exon 17 was performed as described in 
Example 1 above and Chartier-Harlin et al (1991) Neurosci. Letts. 129:134, with 
the following modifications: (a) the amplification primer sequences were: 

ATA-ACC-TCA-TCC-AAA-TGT-CCC-C [SEQ ID NO:28] and 
5 GTA-ACC-CAA-GCA-TCA-TGG-AAG-C [SEQ ID NO:29]; and 

(b) the PCR conditions were 94°C/10 minutes then 35 cycles of 60°C/1 minute, 
72°C/1 minute, 94°C/1 minute, followed by 72°C/5 minutes. 

50 pmol of the second primer were used to generate single stranded 
product, which was then purified (Chartier-Harlin et al loc. cit.). The purified 
10 product was sequenced with the SEQUENASE kit (2.0) (Trade mark; USB) using 
a primer of sequence: 

AAA-TGA-AAT-TCT-TCT-AAT-TGC-G [SEQ ID NO:30]. 

The presence of the T->C transition creates gel artefacts which 
were resolved by the inclusion of inosine (SEQUENASE kit) in the sequencing 
15 reaction. 



Direct sequencing of exons 7 and 16 from affected individuals from F19 (Chartier- 
Harlin et al. loc. cit.) shows that these were of normal sequence and SSCA (Orita 
et al loc. ciu) and Orita et al.) failed to identify changes in exons 2, 3, 7, 9, 12, 
20 13 or 15. SSCA of exon 17 detects both APP693 (Levy, et al loc. cit. and Hardy 
et al (1991) Lancet 337:1342-1343) and APP717 Val->Ile under standard 
screening conditions and, when modified APP717 Val->Gly. 



EXAMPLE 4 - Production of Transgenic Animals with Mutant APP Allele 
25 Generation of the constructs : The vector plink was constructed by 

cloning polylinker between the PvuII and EcoRI sites of pBR322 such that the 
Hindin end of the polylinker was adjacent to the PvuII site. The ligation 
destroyed both the EcoRI and PvuII sites associated with the pBR322 segments. 
The 700bp Hpal to EcoRI fragment of pSV2neo (Southern and Berg (1982) L 
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jjtn[ Apnl. Genet. 1:327) that contains the SV40 polyadenylation signal was 
cloned into the Hpal to EcoRl sites of plink to generate pNotSV. The 200 bp 
Xhol to PstI fragment of pL2 containing the SV40 16S/lgS splice site (Okayama 
and Berg (1983) Mnl. Cell Biol. 3:280) was isolated, blunted with Klenow, then 
cloned into the Hpal site of pNotSV to generate pSplice. The 2.3kb Nrul to Spel 
fragment of pAPP695 containing the coding region of the cDNA for APP (Tana 
et al. (1987) Science 235:880) was cloned into the Nrul to Spd site of pSplice to 
generate pd695. The same strategy was used to generate pd751 using the cDNA 
for APP751 CTanzi et al. (1988) Nature 331:528). A variety of promoters have 
been inserted into the pd695 and pd751 vectors by using the unique Nrul or the 
Hindm and Nrul sites. 



Generation of pshAPP695 ft nshAPP751 : The construct 
pAmyproBam was generated by cloning the 1.5kb BamHI fragment of the APP 
cDNA into the BamHI site of pucl9 xHamy. The 700 bp Hindm to Asp718 
fragment of the pAmyproBam (similar to the 700 bp BamHI to Asp718 fragment 
described in Salbaum et al. (1988) EMBO 7:2807) was cloned into the Hindm to 
Asp718 sites of pd695 and pd751 to yield pshAPP695 and pshAPP751. 

P APPffl5 and pAPP751 : The pAPP695 and pAPP751 vectors were 
generated by a three-way ligation of the 3.0 kb EcoRI to Xhol fragment of 
pAmyProBam, the 1.5kb Xhol to Spel fragment of APP751cDNA, and the Spel 
to EcoRI site of pd751. 



Generation of pNSE751f+47) : The pNSE751 (+47) was 
constructed using a three-way ligation of the Hindffl to Kpnl fragment of pNSE6 
(Forss-Pettererc/. (1990) Neuron 5:187). The Kpnl to BstYl fragment of pNSE6 
and a partial BamHI (-47nt relative to the ATG) to Hindlll fragment of pAPP751 . 
This resulted in the generation of a Kpnl fragment that was cloned into the Kpnl 
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sites of pNSE751(+47). The BstYl/Bam fusion results in the loss of both sites. 

Generation of pNSE751 : This vector was generated using a four 
primer two-step PCR protocol (Higuchi et al. (1988) NucL Acids Res. 16:7351) 
5 that resulted in a direct fusion of the NSE initiation codon to the APP coding 
region. Oligonucleotides C2, 1072, 1073, and A2 (see Nucleotide Sequences, 
infra.) were used to generate a PCR product. The Kpnl fragment was generated 
by digestion with the restriction enzyme. The Kpnl fragment was used to replace 
a similar fragment in pNSE751(+47). 

10 

Generation of pNSE751-Hardv and pNSE751 -Dutch : The Hardy 
(APP642 Val->Ile of APP695) and Dutch (APP618 Gln->Glu of APP695) 
mutations were introduced using a four primer two-PCR protocol. Both sets of 
reactions used the same "outside primers" with the "inside primers" containing the 
15 appropriate mutations. This resulted in the generation of Bgin to Spel fragment 
after digestion, that contained either the Dutch or the Hardy mutation. The Bgin 
to Spel fragment of pNSE751 was replaced by the mutated fragment to generate 
the appropriate vector. The presence of the mutation was conformed by sequence 
analysis of the vectors. 

20 

Generation of pNSE751-Hardv and pNSE751 -Dutch : The Hardy VI 
(APP642 V to I), Hardy VG (APP642 V to G), and Dutch (APP618 E to Q) 
mutations were introduced using the four primer two-step PCR protocol (Higuchi 
et al (1988)). The Hardy VI mutant was generated using primers 117/738, 922, 
25 923, and 785; Hardy VG mutant was generated using primers 117/738, 1105, 
1106, and 785; Dutch mutant was generated using primers 117/738, 1010, 1011, 
and 785. In all these mutations the 700 bp Bglll to Spel fragment was isolated by 
digestion of the PCR product with the restriction enzymes, then cloned into the 
same sites of pNSE751. The mutations were confirmed by sequence analysis. 
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ffrPffffl tinn ofpNFH751 : The human NFH gene (Lees et al. (1988) 
EMBO 7(7): 1947) was isolated from a genomic library using a rat NFH cDNA as 
a probe (Lieberburg et al. (1989) Proc. Natl. Acids. Res. USA 86:2463). An SstI 
fragment was subcloned into the pSK vector. A pair of PCR primers was 
5 generated to place a Nrul site at the 3' end of the 150 bp amplified fragment 
immediately upstream of the initiation codon of the NFH gene. The 5' end 
contains a Kpnl site 150nt upstream of the initiation codon. The final construction 
of pNFH751 was generated by a three-way ligation of the 5.5b Hindm to Kpnl 
fragment of pNFH8.8, the Kpnl to Nrul PCR generated fragment, and the Hindm 

10 to Nrul fragment of pd751 . The sequence surrounding the PCR generated fusion 
at the initiation codon was confirmed by sequence analysis. The Dutch and Hardy 
variants of pNFH751 were generated by substitution of the 600 bp BglH to Spel 
fragment from a sequence confirmed mutated vector for the same fragment of 
pNFH751. The presence of the mutation was confirmed by hybridization with the 

15 mutated oligomer or by sequence analysis. 

feneration of r>Thv751 : The pThy751 vector was generated by a 
three-way ligation. The Hindm to BamHI fragment of pThy8.2 which was 
isolated from a human genomic library (Chang et al. (1985) Proc. Natl. Acad. Sci. 
20 USA 82:3819), the synthetic fragment ThyAPP, and the Hindm to Nrul fragment 
ofpd751. 
ThyAPP: 

CAGACTGAGATCCCAGAACCCTAGGTCTGACTCTAGGGTCTTGG[SEQ 
ID NO:31] 



25 



f eneration ofnThvClOO : This pThyClOO construct was generated 
by a three-way ligation. The 3.6kb Hindin to BamHI fragment of pThy8.2, the 
synthetic fragment ThyAPP2, and the Hindlll to BgHI fragment of pd751 or 
pNSE751 Dutch or pNSE751 Hardy were ligated to yield pThyClOO. 
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ThyAPP2: 

CAGACTGAGATCCCAGAACCGATCCTAGGTCTGACTCTAGGGTCTTGG 

[SEQ ID NO:32] 

The region around the initiation codon was confirmed by sequence 

5 analysis. 

preparation of DNA for injection : The transgene for injection was 
isolated from the corresponding vector of interest for digestion with NotI and gel 
electrophoresis. The transgene was purified by using the Gene Clean kit (BiolOl), 
10 then further purified on an Elutip or HPLC purified on a Nucleogen 4000 column. 

Microinjection : The transgene was injected at 2-20 mcg/ml into the 
most convenient pronucleus (usually the male pronucleus) of FVB or B6D2F2 one- 
cell embryos (Manipulating the Mouse Embryo, B. Hogan, F. Constantini, E. 

15 Lacy, Cold Spring Harbor . 1986). The injected embryos were cultured overnight. 

Embryos that split to the two-cell stage were implanted into pseudo-pregnant 
female CD1 mice. The mice were weaned at approximately 21 days. Samples of 
DNA obtained from tail biopsy were analyzed by Southern blot using a transgene 
specific probe (usually the SV40 3's splice and polyadenylation signal sequences). 

2 0 Transgenic mice harboring at least one copy of the transgene were identified. 

Use of Transgenic Mice: A mouse that expresses the hAPP gene 
or its variants can be used to test the pathogenesis of amyloid deposition and 
therapeutic intervention designed to modulate amyloid deposition. 

Biochemical analysis of the transgenic mice reveals possible 

25 intermediates in the catabolism of APP that are likely precursors to beta-amyloid. 

This analysis can be carried out in the animal or in primary tissue culture of the 
expressing cells. 

The animal can be used to test potential therapeutic agents. The test 
group of mice is treated with the test compound administered in an appropriate 
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fashion for a set period. At the conclusion of the test period, the animals are 
assessed behaviourally, biochemically, and histologically for any possible effects 
of the test compound. The exact protocol depends on the anticipated mechanism 
of action of the test compound. Compounds that may have utility in treating AD 
5 can be identified using this approach. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT: 

(A) NAME: Imperial College of Science, Technology 6 

Medicine (not US) 

(B) STREET: Sherfield Building, Exhibition Road, 

(C) CITY: London 

(E) COUNTRY: GB 

(F) POSTAL CODE (ZIP) : SW7 2AZ 



(A 
(B 
(C 
(E 
(F 

(A 
(B 
(C 
(D 
(E 
(F 

(A 
(B 

(C 
(D 
(E 
(F 

(A 
(B 
(C 
(E 
(F 

(A 
(B 
(C 
(D 
(E 



NAME: HARDY, John Anthony (US only) 
STREET: 187 Drakefell Road 
CITY: London 
COUNTRY: GB 

POSTAL CODE (ZIP) : SE4 

NAME: GOATE, Alison Mary (US only) 
STREET: 100 High Street, Hampton Wick, 
CITY : Kingston-on-Thames 
STATE: Surrey 
COUNTRY: GB 

POSTAL CODE (ZIP) : KT1 4DQ 

NAME: MULLAN, Michael John (US only) 

STREET: Suncoast Gerontology Ctr, 12901 Bruce B. 

Downs Blvd. MDC 50, 
CITY : Tampa 
STATE: Florida 
COUNTRY: US 

POSTAL CODE (ZIP) : 33612 

NAME: CHARTIER-HARLIN, Marie-Christine (US only) 
STREET: 63 Francis Road 
CITY : London 
COUNTRY: GB 

POSTAL CODE (ZIP) : E10 6PN 

NAME: OWEN, Michael John (US only) 

STREET: Four Hedges, Castlehill, LLanblethian, 

CITY : Cowbr idge 

STATE: South Glamorgan 

COUNTRY: GB 



(ii) TITLE OF INVENTION: Test and Model for Alzheimer's Disease 

(iii) NUMBER OF SEQUENCES: 44 

(iv) COMPUTER READABLE FORM: 
Not Applicable 



(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: WO PCT/GB92/ 
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(2) INFORMATION FOR SEQ ID NO:l: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 9° 9 

Trp cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 

* 100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 I 25 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
* 165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 I 85 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 
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Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 



520 



525 
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Val Glu Leu Leu Pro 
530 

Pro Trp His Ser Phe 
545 

Glu Val Glu Pro Val 
565 



Thr Arg Pro Gly Ser 
580 

Glu Val Lys Met Asp 
595 

His His Gin Lys Leu 
610 

Gly Ala He He Gly 
625 

He Val He Thr Leu 
645 

His His Gly Val Val 
660 

His Leu Ser Lys Met 
675 

Phe Phe Glu Gin Met 
690 



Val Asn Gly Glu Phe Ser 
535 

Gly Ala Asp Ser Val Pro 
550 555 

Asp Ala Arg Pro Ala Ala 
570 

Gly Leu Thr Asn He Lys 
585 

Ala Glu Phe Arg His Asp 
600 

Val Phe Phe Ala Glu Asp 
615 

Leu Met Val Gly Gly Val 
630 635 

Val Met Leu Lys Lys Lys 
650 

Glu Val Asp Ala Ala Val 
665 

Gin Gin Asn Gly Tyr Glu 
680 

Gin Asn 
695 



Leu Asp Asp Leu Gin 
540 

Ala Asn Thr Glu Asn 
560 



Asp Arg Gly Leu Thr 
575 

Thr Glu Glu He Ser 
590 

Ser Gly Tyr Glu Val 
605 

Val Gly Ser Asn Lys 
620 

Val He Ala Thr Val 
640 



Gin Tyr Thr Ser He 
655 

Thr Pro Glu Glu Arg 
670 

Asn Pro Thr Tyr Lys 
685 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 HO 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 



210 



220 
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Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 Z4U 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 

275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met lie 
290 295 300 

Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 

325 330 335 

Cys Met Ala Val Cys Gly Ser Ala He Pro Thr Thr Ala Ala Ser Thr 
340 345 350 

Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp Glu Asn Glu 

355 360 365 

His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala Lys His Arg 
370 375 380 

Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala Glu Arg Gin 
385 390 395 400 

Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He Gin His Phe 
405 410 415 

Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn Glu Arg Gin 
420 425 430 

Gin Leu Val Glu Thr His Met Ala Arg Val Glu Ala Met Leu Asn Asp 
435 440 445 

Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu Gin Ala Val 
450 455 460 

Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys Tyr Val Arg 
465 470 475 480 

Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe Glu His Val 
485 490 495 

Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser Gin Val Met 
500 505 510 

Thr His Leu Arg Val He Tyr Glu Arg Met Asn Gin Ser Leu Ser Leu 
515 520 525 
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Leu Tyr Asn Val Pro 
530 

Glu Leu Leu Gin Lys 
545 

Met He Ser Glu Pro 
565 

Ser Leu Thr Glu Thr 
580 

Glu Phe Ser Leu Asp 
595 

Ser Val Pro Ala Asn 
610 

Pro Ala Ala Asp Arg 
625 

Asn He Lys Thr Glu 
645 

Arg His Asp Ser Gly 
660 

Ala Glu Asp Val Gly 
675 

Gly Gly Val Val He 

690 

Lys Lys Lys Gin Tyr 
705 

Ala Ala Val Thr Pro 
725 

Gly Tyr Glu Asn Pro 
740 



Ala Val Ala Glu Glu He Gin Asp Glu Val Asp 
535 540 

Glu Gin Asn Tyr Ser Asp Asp Val Leu Ala Asn 
550 555 560 

Arg He Ser Tyr Gly Asn Asp Ala Leu Met Pro 
570 575 

Lys Thr Thr Val Glu Leu Leu Pro Val Asn Gly 
585 590 

Asp Leu Gin Pro Trp His Ser Phe Gly Ala Asp 
600 605 

Thr Glu Asn Glu Val Glu Pro Val Asp Ala Arg 
615 620 

Gly Leu Thr Thr Arg Pro Gly Ser Gly Leu Thr 
630 635 640 

Glu He Ser Glu Val Lys Met Asp Ala Glu Phe 
650 655 

Tyr Glu Val His His Gin Lys Leu Val Phe Phe 
665 670 

Ser Asn Lys Gly Ala He He Gly Leu Met Val 
680 685 

Ala Thr Val He Val He Thr Leu Val Met Leu 
695 700 

Thr Ser He His His Gly Val Val Glu Val Asp 
710 715 720 

Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 
730 735 

Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
745 750 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 770 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 1° 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 4 5 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 9° 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 I 10 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 16° 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
* 165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 I 90 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
!95 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 2 1 5 220 
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Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Glu Val Cys Ser Glu Gin Ala Glu Thr Gly Pro Cys Arg Ala Met He 
290 295 300 



Ser Arg Trp Tyr Phe Asp Val Thr Glu Gly Lys Cys Ala Pro Phe Phe 
305 310 315 320 

Tyr Gly Gly Cys Gly Gly Asn Arg Asn Asn Phe Asp Thr Glu Glu Tyr 



325 



330 



Cvs Met Ala Val Cys Gly Ser Ala Met Ser Gin Ser Leu Leu Lys Thr 
340 345 350 

Thr Gin Glu Pro Leu Ala Arg Asp Pro Val Lys Leu Pro Thr Thr Ala 
355 360 365 

Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu Glu Thr Pro Gly Asp 
370 375 380 

Glu Asn Glu His Ala His Phe Gin Lys Ala Lys Glu Arg Leu Glu Ala 
385 390 395 400 

Lys His Arg Glu Arg Met Ser Gin Val Met Arg Glu Trp Glu Glu Ala 
■ - ~ 415 



405 



410 



Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp Lys Lys Ala Val He 
420 425 430 

Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu Gin Glu Ala Ala Asn 
435 440 445 

Glu Arg Gin Gin Leu Val Glu Thr His Met Ala Arg \fal Glu Ala Met 
450 455 460 

Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn Tyr He Thr Ala Leu 
465 470 475 480 

Gin Ala Val Pro Pro Arg Pro Arg His Val Phe Asn Met Leu Lys Lys 
485 490 495 

Tvr Val Arg Ala Glu Gin Lys Asp Arg Gin His Thr Leu Lys His Phe 
500 505 51° 

Glu His Val Arg Met Val Asp Pro Lys Lys Ala Ala Gin He Arg Ser 
515 520 525 
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Gin Val Met Thr His Leu Arg Val lie Tyr Glu Arg Met Asn Gin Ser 
530 535 540 

Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala Glu Glu He Gin Asp 
545 550 555 560 

Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn Tyr Ser Asp Asp Val 
565 570 575 

Leu Ala Asn Met He Ser Glu Pro Arg He Ser Tyr Gly Asn Asp Ala 
580 585 590 

Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr Val Glu Leu Leu Pro 
595 600 605 

Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin Pro Trp His Ser Phe 
610 615 620 

Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val 
625 630 635 640 

Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser 
645 650 655 

Gly Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp 
660 665 670 

Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val His His Gin Lys Leu 
675 680 685 

Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys Gly Ala He He Gly 
690 695 700 

Leu Met Val Gly Gly Val Val He Ala Thr Val He Val He Thr Leu 
705 710 715 720 

Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val 
725 730 735 

Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met 
740 745 750 

Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met 

755 760 765 



Gin Asn 
770 



WO 92/13069 PCT/GB92/00123 



47 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


GCTGGAGGTA 


60 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


TTGCCATGTT 


CTGTGGCAGA 


120 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


180 


ACCTGCATTG 


ATACCAAGGA 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


240 


CAGATCACCA 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


inn 


GGCCGCAAGC 


AGTGCAAGAC 


MM » r*« AM M M m 

CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 




^ fin 


GAGTTTGTAA 


GTGATGCCCT 


TCTCGTTCCT 


G ACAAGTG C A 


AATTCTTACA 


UUA(jUA<jA*jLt 


** rf* U 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


CACACCG TCG 


L. L. AAAb AVj AC 




/an 
■* o w 


AAGAGTACCA 


ACTTGCATGA 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


540 


GGGGTAGAGT 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


600 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


TGCAGATGGG 


660 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


TGGCTGAGGT 


GGAAGAAGAA 


720 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


780 


CCCTACGAAG 


AAGCCACAGA 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


840 


GAGTCTGTGG 


AAGAGGTGGT 


TCGAGTTCCT 


ACAACAGCAG 


CCAGTACCCC 


TGATGCCGTT 


900 


GACAAGTATC 


TCGAGACACC 


TGGGGATGAG 


AATGAACATG 


CCCATTTCCA 


GAAAGCCAAA 


960 


GAGAGGCTTG 


AGGCCAAGCA 


CCGAGAGAGA 


ATGTCCCAGG 


TCATGAGAGA 


ATGGGAAGAG 


1020 


GCAGAACGTC 


AAGCAAAGAA 


CTTGCCTAAA 


GCTGATAAGA 


AGGCAGTTAT 


CCAGCATTTC 


1080 


CAGGAGAAAG 


TGGAATCTTT 


GGAACAGGAA 


GCAGCCAACG 


AGAGACAGCA 


GCTGGTGGAG 


1140 


ACACACATGG 


CCAGAGTGGA 


AGCCATGCTC 


AATGACCGCC 


GCCGCCTGGC 


CCTGGAGAAC 


1200 


TACATCACCG 


CTCTGCAGGC 


TGTTCCTCCT 


CGGCCTCGTC 


ACGTGTTCAA 


TATGCTAAAG 


1260 
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AAGTATGTCC 


GCGCAGAACA 


GAAGGACAGA 


CAGCACACCC 


TAAAGCATTT 


CGAGCATGTG 


1320 


CGCATGGTGG 


ATCCCAAGAA 


AGCCGCTCAG 


ATCCGGTCCC 


AGGTTATGAC 


ACACCTCCGT 


1380 


GTGATTTATG 


AGCGCATGAA 


TCAGTCTCTC 


TCCCTGCTCT 


ACAACGTGCC 


TGCAGTGGCC 


1440 


GAGGAGATTC 


AGGATGAAGT 


TGATGAGCTG 


CTTCAGAAAG 


AGCAAAACTA 


TTCAGATGAC 


1500 


GTCTTGGCCA 


ACATGATTAG 


TGAACCAAGG 


ATCAGTTACG 


GAAACGATGC 


TCTCATGCCA 


1560 


TCTTTGACCG 


AAACGAAAAC 


CACCGTGGAG 


CTCCTTCCCG 


TGAATGGAGA 


GTTCAGCCTG 


1620 


GACGATCTCC 


AGCCGTGGCA 


TTCTTTTGGG 


GCTGACTCTG 


TGCCAGCCAA 


CACAGAAAAC 


1680 


GAAGTTGAGC 


CTGTTGATGC 


CCGCCCTGCT 


GCCGACCGAG 


GACTGACCAC 


TCGACCAGGT 


1740 


TCTGGGTTGA 


CAAATATCAA 


GACGGAGGAG 


ATCTCTGAAG 


TGAAGATGGA TGCAGAATTC 


1800 


CGACATGACT 


CAGGATATGA 


AGTTCATCAT 


CAAAAATTGG 


TGTTCTTTGC 


AGAAGATGTG 


1860 


GGTTCAAACA 


AAGGTGCAAT 


CATTGGACTC 


ATGGTGGGCG 


GTGTTGTCAT 


AGCGACAGTG 


1920 


ATCGTCATCA 


CCTTGGTGAT 


GCTGAAGAAG 


AAACAGTACA 


CATCCATTCA 


TCATGGTGTG 


1980 


GTGGAGGTTG 


ACGCCGCTGT 


CACCCCAGAG 


GAGCGCCACC 


TGTCCAAGAT 


GCAGCAGAAC 


2040 


GGCTACGAAA 


ATCCAACCTA 


CAAGTTCTTT 


GAGCAGATGC 


AGAACTAG 




2088 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATGCTGCCCG 


GTTTGGCACT 


GCTCCTGCTG 


GCCGCCTGGA 


CGGCTCGGGC 


GCTGGAGGTA 


60 


CCCACTGATG 


GTAATGCTGG 


CCTGCTGGCT 


GAACCCCAGA 


TTGCCATGTT 


CTGTGGCAGA 


120 


CTGAACATGC 


ACATGAATGT 


CCAGAATGGG 


AAGTGGGATT 


CAGATCCATC 


AGGGACCAAA 


180 


ACCTGCATTG 


ATACCAAGGA 


AGGCATCCTG 


CAGTATTGCC 


AAGAAGTCTA 


CCCTGAACTG 


240 


CAGATCACCA 


ATGTGGTAGA 


AGCCAACCAA 


CCAGTGACCA 


TCCAGAACTG 


GTGCAAGCGG 


300 


GGCCGCAAGC 


AGTGCAAGAC 


CCATCCCCAC 


TTTGTGATTC 


CCTACCGCTG 


CTTAGTTGGT 


360 


GAGTTTGTAA 


GTGATGCCCT 


TCTCGTTCCT 


GACAAGTGCA 


AATTCTTACA 


CCAGGAGAGG 


420 


ATGGATGTTT 


GCGAAACTCA 


TCTTCACTGG 


CACACCGTCG 


CCAAAGAGAC 


ATGCAGTGAG 


480 


AAGAGTACCA 


ACTTGCATGA 


CTACGGCATG 


TTGCTGCCCT 


GCGGAATTGA 


CAAGTTCCGA 


540 


GGGGTAGAGT 


TTGTGTGTTG 


CCCACTGGCT 


GAAGAAAGTG 


ACAATGTGGA 


TTCTGCTGAT 


600 


GCGGAGGAGG 


ATGACTCGGA 


TGTCTGGTGG 


GGCGGAGCAG 


ACACAGACTA 


TGCAGATGGG 


660 


AGTGAAGACA 


AAGTAGTAGA 


AGTAGCAGAG 


GAGGAAGAAG 


TGGCTGAGGT 


GGAAGAAGAA 


720 


GAAGCCGATG 


ATGACGAGGA 


CGATGAGGAT 


GGTGATGAGG 


TAGAGGAAGA 


GGCTGAGGAA 


780 


CCCTACGAAG 


AAGCCACAGA 


GAGAACCACC 


AGCATTGCCA 


CCACCACCAC 


CACCACCACA 


840 


GAGTCTGTGG 


AAGAGGTGGT 


TCGAGAGGTG 


TGCTCTGAAC 


AAGCCGAGAC 


GGGGCCGTGC 


900 


CGAGCAATGA 


TCTCCCGCTG 


GTACTTTGAT 


GTGACTGAAG 


GGAAGTGTGC 


CCCATTCTTT 


960 


TACGGCGGAT 


GTGGCGGCAA 


CCGGAACAAC 


CGGAACAACT 


TTGACACAGA 


AGAGTACTGC 


1020 


ATGGCCGTGT 


GTGGCAGCGC 


CATTCCTACA 


ACAGCAGCCA 


GTACCCCTGA 


TGCCGTTGAC 


1080 


AAGTATCTCG 


AGACACCTGG 


GGATGAGAAT 


GAACATGCCC 


ATTTCCAGAA 


AGCCAAAGAG 


1140 


AGGCTTGAGG 


CCAAGCACCG 


AGAGAGAATG 


TCCCAGGTCA 


TGAGAGAATG 


GGAAGAGGCA 


1200 


GAACGTCAAG 


CAAAGAACTT 


GCCTAAAGCT 


GATAAGAAGG 


CAGTTATCCA 


GCATTTCCAG 


1260 
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GAGAAAGTGG 


AATCTTTGGA 


ACAGuAAu LA 




GACAGCAGCT 


GGTGGAGACA 


1320 


CACATGGCCA 


GAGTGGAAGC 


r% » nyf* t ** PP ^ ft 1* 
CATC? CT LAA X 




GCCTGGCCCT 


GGAGAACTAC 


1380 


ATCACCGCTC 


TGCAGGCTGT 


TCCTCCTCGG 




TGTTCAATAT 

X w X X wAA X A X 


GCTAAAGAAG 


1440 


TATGTCCGCG 


CAGAACAGAA 


GGAGAGACAl? 




AGCATTTCGA 


GCATGTGCGC 


1500 


ATGGTGGATC 


CCAAGAAAGC 


CGCTCAGAi v- 


WUU X CV-wiwV? 


TTATGACACA 


CCTCCGTGTG 


1560 


ATTTATGAGC 


GCATGAATCA 


GTCTCTCTCC 




A CGTGCCTGC 

AWw X www A ww 


AGTGGCCGAG 


1620 


GAGATTCAGG 


ATGAAGTTGA 


TGAGCTG CTT 


LAbAAnunVj w 


AAAACTATTC 

AAAAv X A X x w 


AGATGACGTC 


1680 


TTGGCCAACA 


TGATTAGTGA 


ACCAAGGATC 


* /^rprpft rTT* ft A 
AC? 1 X AwLsuAA 


ACGATGCTCT 


CATGCCATCT 


1740 


TTGACCGAAA 


CGAAAACCAC 


CGTGGAGCTC 


CTX X LLLbJ. Lj A 


ATGGAGAGTT 


CAGCCTGGAC 


1800 


GATCTCCAGC 


CGTGGCATTC 


TTTTGGGGCT 


G ACT CTG x C* w 


CAGCCAACAC 


AGAAAACGAA 


1860 


GTTGAGCCTG 


TTGATGCCCG 


CCCTGCTGCC 


GAwwUAvjLjAw 


TGACCACTCG 


ACCAGGTTCT 


1920 


GGGTTGACAA 


ATATCAAGAC 


GGAGGAGATC 


TCTGAAGTGA 


AGATGGATGC 


AGAATTCCGA 


1980 


CATGACTCAG 


GATATGAAGT 


TCATCATCAA 


AAATTGGTGT 


TCTTTGCAGA AGATGTGGGT 


2040 


TCAAACAAAG 


GTGCAATCAT 


TGGACTCATG 


GTGGGCGGTG 


TTGTCATAGC 


GACAGTGATC 


2100 


GTCATCACCT 


TGGTGATGCT 


GAAGAAGAAA 


CAGTACACAT 


CCATTCATCA 


TGGTGTGGTG 


2160 


GAGGTTGACG 


CCGCTGTCAC 


CCCAGAGGAG 


CGCCACCTGT 


CCAAGATGCA 


GCAGAACGGC 


2220 


TACGAAAATC 


CAACCTACAA 


GTTCTTTGAG 


CAGATGCAGA 


ACTAG 




2265 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

lie Ala Thr Val lie Xaa lie Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

He Ala Thr Val He Gly He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

He Ala Thr Val He Met He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

He Ala Thr Val He Ala He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

He Ala Thr Val He Ser He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

He Ala Thr Val He He He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

lie Ala Thr Val lie Leu lie Thr Leu 
1 5 



, _ PCT/GB92/00123 
WO 92/13069 



58 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

He Ala Thr Val He Thr He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

He Ala Thr Val He Pro He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

He Ala Thr Val He His He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

He Ala Thr Val He Cys He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

He Ala Thr Val He Tyr He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

He Ala Thr Val He Phe He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

He Ala Thr Val He Glu He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

He Ala Thr Val He Trp He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

He Ala Thr Val He Arg He Thr Leu 
1 5 
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) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

He Ala Thr Val He Asp He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

He Ala Thr Val He Asn He Thr Leu 
1 5 
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INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

lie Ala Thr Val lie Lys lie Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

He Ala Thr Val He Gin He Thr Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CCTCATCCAA ATGTCCCCGT CATT 



24 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCCTAATTCT CTCATAGTCT TAATTCCCAC 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
ATAACCTCAT CCAAATGTCC CC 
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(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GTAACCCAAG CATCATGGAA GC 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAATGAAATT CTTCTAATTG CG 
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(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CAGACTGAGA TCCCAGAACC CTAGGTCTGA CTCTAGGGTC TTGG 
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(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (Primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CAGACTGAGA TCCCAGAACC GATCCTAGGT CTGACTCTAG GGTCTTGG 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CGACCAGGTT GTGGGTTGAC AAATA 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
AATCTATTCA TGCACTAGTT TGATACAGC 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ACAGTGATCA TCATCACCTT G 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 6: 
CAAGGTGATG ATGATCACTG T 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
AGCGACAGTG ATCGGCATCA CCTTGGTG 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CACCCAGGTG ATGCCGATCA CTGTCGCT 
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(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
ACCCACATCT TGTGCAAAGA ACAC 
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(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GTGTTCTTTG CACAAGATGT GGGT 
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(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CCCAGCCATC ATGCTGCCCG GGTTGGC 
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(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
GCCAACCCGG GCAGCATGAT GACTGGGATC TC 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
ACCTGCCACT ATACTGGAAT A 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
TGTGCATGTT CAGTCTGCCA C 
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wpAT TS HT.ATMEDIS: 

1. An isolated polynucleotide comprising a nucleic acid sequence 
encoding a codon 717 mutant of human amyloid precursor protein 770 (APP770), 

5 or an isoform or fragment of APP770 having a mutant amino acid residue at the 
position encoded by codon 717. 

2. An isolated polynucleotide of claim 1, wherein the amino acid 
at the position encoded by codon 717 is an isoleucine, glycine, or phenylalanine. 

10 

3. An isolated polynucleotide of claim 1, wherein the nucleic acid 
sequence is a cDNA. 

4. A composition comprising a polynucleotide probe capable of 
15 specifically hybridizing to an amyloid precursor protein 770 (APP770) allele 

exhibiting a mutation at codon 717. 

5. A composition of claim 4, wherein codon 717 of the mutant 
allele encodes an isoleucine, phenylalanine, or glycine. 



20 



25 



6. A composition of claim 4, wherein the probe is labeled. 

7. A composition of claim 4, wherein the probe comprises at 
least about 10 nucleotides spanning amino acid 717 of the APP770 allele. 

8. A transgenic host comprising a nucleic acid segment 
encoding a position 717 mutant of human amyloid precursor protein 770 
(APP770), or an APP770 isoform or fragment of APP770 having the mutation. 
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9. A host of claim 8, which is a primary or immortalized 
eukaryotic cell line. 

10. A host of claim 8, which is a bacterium. 

5 

11. A host of claim 8, wherein the segment is integrated into the 

host genome. 

12. A host of claim 8, which is a non-human animal having the 
10 DNA segment incorporated into its germline and which is capable of expressing 

the mutant APP770 protein. 



13. A host of claim 12, wherein the mutant APP770 protein is 
the sole APP770 protein produced by the animal. 

15 

14. A transgenic non-human animal with germ cells or somatic 
cells comprising a heterologous gene encoding a position 717 mutant amyloid 
precursor protein 770 (APP770), which gene upon expression promotes 
neuropathological characteristics of Alzheimer's disease in the animal. 

20 

15. A cultured human primary or immortalized cell, comprising 
a nucleic acid segment encoding a position 717 mutant of human amyloid precursor 
protein 770 (APP770), or an APP770 isoform or fragment of APP770 having the 
mutation. 

25 

16. A method of screening for an agent capable of treating 
Alzheimer's disease, comprising: 

contacting a host of claim 8 with the agent; and monitoring 
expression or processing of proteins encoded by the mutant APP770 gene. 
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17. A diagnostic method for determining an inherited 
predisposition to Alzheimer's disease in a subject, comprising detecting in the 
subject the presence of an allele of amyloid precursor protein (APP), an isoform 
or fragment thereof, wherein said allele has a sequence polymorphism at a position 

5 corresponding to codon 717 of APP770. 

18. A method of claim 17, wherein said sequence polymorphism 
is a nucleotide substitution, whereby an isoleucine or glycine is substituted at 
codon 717 of APP770. 

10 

19. A method of claim 17, wherein said sequence polymorphism 
is a single nucleotide substitution. 

20. A method according to claim 17, wherein the detecting step 
15 comprises sequencing a genomic DNA segment from chromosome 21 of the 

subject. 

21. A method according to claim 17, wherein the detecting step 
comprises (i) mixing a nucleic acid sample from the subject with one or more 

20 polynucleotide probes capable of hybridizing selectively to an APP gene allele in 
a reaction and (ii) monitoring the reaction to determine the presence of the gene 
allele in the sample, thereby indicating whether the subject is at risk for 
Alzheimer's disease. 

25 22. A method according to claim 20, wherein one probe is a 

polynucleotide comprising a sequence of at least about 10 nucleotides spanning 
codon 717 of an APP770. 
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23. A method according to claim 22, wherein the probes are 
oligonucleotides capable of priming polynucleotide synthesis in a polymerase chain 
reaction, wherein a reaction product comprises a sequence of at least 25 contiguous 
nucleotides from exon 17 of the APP gene. 

5 

24. A method according to claim 21, wherein at least one 
oligonucleotide specifically hybridizes to a sequence present in an intron or 
flanking region of an APP770 gene. 

10 25. A method according to claim 21 , wherein the monitoring step 

comprises analyzing sequencing gel reaction products from the PGR reaction. 

26. A method according to claim 21 , wherein the monitoring step 
comprises analyzing an autoradiograph of a Bell digest of reaction products from 

15 the PCR reaction. 

27. A method according to claim 17, wherein the detecting step 
comprises (i) mixing in an immunological assay an APP770 or isoform protein 
sample from the subject with an antibody reagent specific for the allele and 

20 (ii) monitoring the assay to determine specific binding between the antibody 
reagent and the protein sample, thereby indicating whether the subject is at risk for 
Alzheimer's disease. 

28. A method according to claim 27, wherein the antibody 
2 5 reagent is a monoclonal antibody specifically reactive with an antigenic determinant 

specific for an allele. 

29. A method for genetic analysis of a human subject which 
comprises detecting the presence or absence of at least one polymorphism at codon 

30 717 of an APP770 gene of an amyloid precursor protein (APP) gene in the subject. 



WO 92/13069 PCT/GB92/00123 



94 

30. A method according to claim 29, wherein the polymorphism 
is detected by digesting genomic DNA from the subject with at least one restriction 
endonuclease and hybridizing resulting fragments with a detecting probe 

5 31. A composition comprising a polypeptide free from human 

proteins, comprising a core sequence: 

Ile-Ala-Thr-Val-Ile-X-ne-Thr-Leu- [SEQ ID NO:6] 
wherein X is any of the twenty conventional amino acids except valine. 

10 32. A transgenic nonhuman animal containing a polypeptide of 

claim 31. 

33. A transgenic nonhuman animal of Claim 32, wherein the 
polypeptide is present in the brain. 

15 

34. An isolated polynucleotide, comprising a nucleic acid 
sequence encoding a mutant human APP allele that cosegregates with a genetic 
predisposition to Alzheimer's disease. 

20 35. An isolated polypeptide of Claim 34, wherein said mutant 

human APP allele comprises a codon 717 mutant. 

36. A method of determining a genetic predisposition of a subject 
to Alzheimer's disease, the method comprising detecting in the subject's DNA the 

25 presence of an allele of a gene encoding amyloid precursor protein (APP). 

37. A method as claimed in claim 36, wherein the step of 
detection is carried out on material removed from, and not returned to, the 
subject's body. 
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38. A method as claimed in claim 36 or 37, wherein the allele 
of the gene encodes a substitution mutant of APP. 

39. A method as claimed in claim 38, wherein a single amino 
5 acid is substituted for another. 
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